# CORE.zip

CORE.zip is a supplemental file for Programming for Corpus Linguistics with Python and Dataframes (Keller, in press).

The CORE.zip archive contains CORE.pickle, a file created using Python's pickle object serialization system (protocol 5).

CORE.pickle is a serialized Pandas' DataFrame version of the Corpus of Online Registers of English. It contains the full text and metadata of the corpus in a single file. See Biber and Egbert (2018) for additional information on the corpus. See Keller (in press) for information on how to use the corpus for corpus linguistic research.


## Installation

Unzip CORE.pickle to the working directory for your Python installation. 


## Usage

```
import pandas as pd

# reads CORE.pickle into memory and stores it in CORE
CORE = pd.read_pickle('CORE.pickle')

# displays CORE's shape
print(CORE.shape)

# displays CORE's head
print(CORE.head())
```

# References

Biber, D., & Egbert, J. (2018). Register variation online. Cambridge University Press.

Keller, D. (in press). Programming for Corpus Linguistics with Python and Dataframes. Cambridge University Press.